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Abstract 

We provide an alternative proof for the capacity region of the degraded Gaussian 
multiple-input multiple-output (MIMO) broadcast channel. Our proof does not use 
the channel enhancement technique as opposed to the original proof of Weingertan et. 
al. and the alternative proof of Liu et. al. Our proof starts with the single-letter 
description of the capacity region of the degraded broadcast channel, and directly 
evaluates it for the degraded Gaussian MIMO broadcast channel by using two main 
technical tools. The first one is the generalized de Bruijn identity due to Palomar et. al. 
which provides a connection between the differential entropy and the Fisher information 
matrix. The second tool we use is an inequality due to Dembo which lower bounds the 
differential entropy in terms of the Fisher information matrix. 



*This work was supported by NSF Grants CCF 04-47613, CCF 05-14846, CNS 07-16311 and CCF 07- 
29127. 
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1 Introduction 



The Gaussian multiple-input multiple-output (MIMO) broadcast channel consists of one 
transmitter and an arbitrary number of receivers, where the transmitter and receivers are 
equipped with multiple antennas. In this channel, each link between the transmitter and 
each receiver is modeled by a linear additive Gaussian channel. In general, the Gaussian 
MIMO broadcast channel is non-degraded, thus, we do not have a single-letter description 
of the capacity region. Despite this lack of a single-letter description, the capacity region 
of the Gaussian MIMO broadcast channel is successfully obtained in pQ. Subsequently, an 
alternative proof is given in [2] . In both proofs, the channel enhancement technique [1] is 
the main tool. 

Reference [1] obtains the capacity region of the Gaussian MIMO broadcast channel in 
three main steps. As the first step, pQ finds the capacity region of the degraded Gaussian 
MIMO broadcast channel. To this end, [1] first shows that as opposed to the scalar Gaussian 
broadcast channel [3], the entropy-power inequality falls short of providing a converse proof 
for the degraded Gaussian vector, i.e., MIMO, broadcast channel. This insufficiency of the 
entropy-power inequality is alleviated by the invention of the channel enhancement tech- 
nique p]. Using this technique, [1] constructs a new degraded Gaussian MIMO broadcast 
channel for each point on the boundary of the Gaussian rate regional of the original degraded 
channel, where the boundaries of the Gaussian rate regions of both channels intersect at that 
point, and the capacity region of the constructed degraded channel includes the capacity re- 
gion of the original one. Then, pQ completes the first step of the proof by showing that the 
Gaussian rate region is the capacity region of the constructed degraded channel, for which 
Bergmans' converse [3] can be adapted as opposed to the original channel. 

Secondly, [Tj considers the aligned Gaussian MIMO broadcast channel, where the trans- 
mitter and all receivers have the same number of antennas. This channel is not degraded, 
thus, there is no single-letter expression for its capacity region. Reference pQ shows that the 
achievable rate region obtained by using dirty paper coding (DPC), i.e., the DPC region, is 
the capacity region of the aligned channel. For this purpose, [TJ uses the channel enhance- 
ment technique one more time along with the capacity result obtained for the degraded 
Gaussian MIMO broadcast channel in the first step. 

Finally, pQ considers the general, not necessarily degraded or aligned, Gaussian MIMO 
broadcast channel and shows that the DPC region again amounts to the capacity region 
by using some limiting arguments in conjunction with the capacity result obtained for the 
aligned channel. 

Similar to the proof of pQ, the alternative proof in |2] uses the channel enhancement 
technique as well. The alternative proof in [2] can be divided into two parts. In the first 
part, [2] considers an optimization problem which is the maximization of the difference of two 

lr The Gaussian rate region refers to the achievable rate region obtained by superposition coding and 
successive decoding with Gaussian codebooks. 
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differential entropy terms (see Theorem 1 of [2]) which cannot be solved by a stand-alone use 
of the entropy-power inequality. Next, [2] provides two proofs for the fact that the Gaussian 
distribution is the maximizer of this optimization problem. In both proofs provided in [2], 
the channel enhancement technique is used. In the second part, [2] considers Marton's outer 
bound [4J for the general broadcast channel, and evaluates it for the aligned Gaussian MIMO 
broadcast channel by using the optimization problem solved in the first part. This evaluation 
yields the capacity region of the two-user aligned Gaussian MIMO broadcast channel. 

We note that though the proof in [2] is for the aligned, not necessarily degraded, Gaussian 
MIMO broadcast channel, if this proof is adapted to find the capacity region of the degraded 
Gaussian MIMO broadcast channel, again the channel enhancement technique will be needed. 
In particular, for the degraded case, the optimization problem solved in the first part of the 
proof in [2] will change slightly, however the need for channel enhancement will remain. In 
fact, the optimization problem needed for the degraded case is a special case of the original 
optimization problem solved in the first part of the proof in [2] , which is given in Corollary 4 
of 0. 

Here, we revisit the degraded Gaussian MIMO broadcast channel and provide an alter- 
native proof for the capacity region of this degraded channel, without using the channel 
enhancement technique. Though channel enhancement is an elegant technique that finds 
itself diverse applications, we believe that our proof is more direct. On the other hand, our 
proof is limited to the degraded case and does not seem to be extendable for the general case. 
In other words, to obtain the capacity region for the general case after finding the capacity 
region for the degraded case through our proof, one needs to use the channel enhancement 
technique []]. 

Our proof starts with the single-letter description of the capacity region of the degraded 
broadcast channel and directly evaluates it for the Gaussian MIMO case by using two main 
tools. The first one is the generalized de Bruijn identity due to [5] that states a connection 
between the differential entropy and the Fisher information matrix. The second tool we use 
is an inequality due to J6j[7] that gives a lower bound for the differential entropy in terms of 
the Fisher information matrix. 

Finally, our technique used in this alternative proof can be useful in other vector Gaussian 
multi-terminal information theory problems when proving the optimality of Gaussian random 
vectors. In fact, we have used a variant of this technique in proving the secrecy capacity 
region of the Gaussian MIMO multi- receiver wiretap channel in [8], and the secrecy capacity 
region of the Gaussian MIMO degraded compound multi- receiver wiretap channel in [9]. 
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2 Channel Model and Main Result 



The (aligned) degraded .fT-user Gaussian MIMO broadcast channel is defined by 

Y fe = X + N fc , k = l,...,K (1) 

where is Gaussian with covariance matrix k = 1, . . . ,K, and the channel input X 
and outputs {Y^}^ satisfy the Markov chain 

X -> Yi Yjf (2) 

which is equivalent to the covariance matrices {Sfc}^L 1 satisfying the following order 

O^Si^...^S^ (3) 

The channel input is subject to a covariance constraint 

E [XX T ] r< S (4) 

where we assume S y 0. The covariance constraint in (jlj) is more general than many other 
constraints including the trace constraint, in the sense that, once the capacity region is found 
for the constraint in (jlj), capacity regions arising from the use of other constraints subsumed 
by (0J can be obtained by using this capacity region pp. 

We next note that the definition of degradedness can be generalized to the case where 
receivers get arbitrary linear combinations of the channel inputs, i.e., 

Y fc = H fe X + N fc , k = l,...,K (5) 

The broadcast channel defined in ((SJ) is said to be degraded, i.e., satisfies the Markov chain in 
(j2J), if there exist matrices {Dk}k=\ such that D fc H fc = H fc+1 and D fc Dj ^ I [10]. However, 
once the capacity region of the aligned degraded Gaussian MIMO broadcast channel defined 
by ([!]) is obtained, the capacity region of the general degraded Gaussian MIMO broadcast 
channel defined by (JSJ) can be obtained by following the analysis given in Section 5 of [10J, 
which essentially relies on some limiting arguments. Since the key step to obtain the ca- 
pacity region of the general degraded Gaussian MIMO broadcast channel defined by (jSJ) is 
to establish the capacity region of the aligned degraded Gaussian MIMO broadcast channel 
defined by (OQ), here we consider only the latter channel model. 

The capacity region of the Gaussian MIMO broadcast channel is established in [1] for 
the most general case. For the degraded case, it is given as follows. 

Theorem 1 ([lj, Theorem 2) The capacity region of the K-user degraded Gaussian MIMO 
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broadcast channel is given by the union of rate tuples (R\, . . . , Rk) satisfying 



1, IZtiKi + E 



R k <- log '^rr" 1 — (6) 

where the union is over all positive semi- definite matrices {Kj}^£ 1 such that Y2iLi Kj = S. 

In the next section, we provide an alternative proof for this theorem for K = 2, and in 
Section [4] we extend this proof to the case K > 2. In both cases, we directly evaluate the 
capacity region of the degraded broadcast channel which is stated in the following theorem, 
for the Gaussian MIMO channel at hand. 

Theorem 2 ([llj, Theorem 15.6.2) The capacity region of the degraded broadcast chan- 
nel is given by the union of rate tuples (Rx, . . . , Rk) satisfying 

R k <I(U k ;Y k \U k+1 ), k = l,...,K (7) 

where Uk+i — <fi, U\ — X, and the union is over all {U k }^ =2 ,X such that 

U K -> . . . -> U 2 X Y 1 -> . . . Y K (8) 



3 Proof of Theorem ffl for K = 2 



3.1 Background 

We need some properties of the Fisher information and the differential entropy, which are 
provided next. 

Definition 1 (|8J, Definition 3) Let (U, X) be an arbitrarily correlated length-n random 
vector pair with well-defined densities. The conditional Fisher information matrix o/X given 
U is defined as 



J(X|U) = E [p(X|U)p(X|U) T ] 



(9) 



where the expectation is over the joint density f(u,x), and the conditional score function 
p(x|u) is 



p(x|u) = Vlog/(x|u) 



<91og/(x|u) 



<91og/(x|u) 



(10) 



dxi dx n 
We first present the conditional form of the Cramer- Rao inequality, which is proved in [8] . 
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Lemma 1 (|8J, Lemma 13) Let U, X be arbitrarily correlated random vectors with well- 
defined densities. Let the conditional covariance matrix of X be Cov(X|U) y 0, then we 
have 

J(X|U) t Cov(XlU)- 1 (11) 

which is satisfied with equality if (U, X) is jointly Gaussian with conditional covariance 
matrix Cov(X|U). 

The following lemma will be used in the upcoming proof. The unconditional version of 
this lemma, i.e., the case T = </>, is proved in Lemma 6 of [8J. 

Lemma 2 (|8J, Lemma 6) Let T,U,Vi,V2 be random vectors such that (T,U) and 
(Vi, V 2 ) are independent. Moreover, let Vi, V 2 be Gaussian random vectors with covariance 
matrices Si, S 2 such that -< S x ^ S 2 . Then, we have 

J- X (U + V 2 |T) - S 2 y J-\U + Vi|T) - Si (12) 

The following lemma will also be used in the upcoming proof. 

Lemma 3 (|8J, Lemma 8) Let K l5 K 2 be positive semi-definite matrices satisfying ^ 
Ki ^ K 2; and f(K) be a matrix-valued function such that f(K) y for Ki ^ K ^ K 2 . 
TTien, we /iave 

/ f(K)dK > (13) 

The following generalization of the de Bruijn identity [T2j,[T3] is due to [5], where the un- 
conditional form of this identity, i.e., U = 0, is proved. Its generalization to this conditional 
form for an arbitrary U is rather straightforward, and is given in Lemma 16 of [8]. 

Lemma 4 (|8J, Lemma 16) Let (U,X) be an arbitrarily correlated random vector pair 
with finite second order moments, and also be independent of the random vector N which is 
zero-mean Gaussian with covariance matrix S^r >- 0. Then, we have 

V SiV /i(X + N|U) = ^J(X + N|U) (14) 

The following lemma is due to [HIE] which lower bounds the differential entropy in terms 
of the Fisher information matrix. 

Lemma 5 ([HIE]) Let (U, X) be an (n + 1) -dimensional random vector, where the condi- 
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tional Fisher information matrix o/X, conditioned on U , exists. Then, we have 

/i(X|t/)>ilog(2^e)"|J- 1 (X|/7)| (15) 

In [6j[7], the unconditional version of this lemma, i.e., U — 0, is provided. A proof for its 
generalization to this conditional form is given in Appendix |A] 



3.2 Proof for K = 2 

We first rewrite the capacity region of the degraded broadcast channel given in Theorem [2] 
for two users as a union of rate pairs (f?i, R2) satisfying 

fli</(X;li|E0 (16) 
R 2 <I(U;Y 2 ) (17) 

where we dropped the subscript of the auxiliary random variable U 2 and denoted it simply 
as U. The involved random variables satisfy the Markov chain U — > X — > Y\ — Y Y 2 . To 
obtain the capacity region of the degraded Gaussian MIMO broadcast channel, we need to 
evaluate this region. In particular, we will show that the optimal random vector (U, X) that 
exhausts this region is Gaussian, and the corresponding capacity region is given by the union 
of rate pairs {R\,R 2 ) satisfying 

fl,<ilogJ^ii (18) 

where the union is over all K such that -< K -< S. We note that the region described by 
(I18p - (ll9p comes from Theorem [T] by dropping the subscript of Kx and denoting it simply as 
K. 

We begin with the bound on R 2 . Starting from (fTT|) . we get 



R 2 <I(U;Y 2 ) (20) 

= h(Y 2 ) - h(Y 2 \U) (21) 

< X - log(27re) n |S + S 2 | - h(Y 2 \U) (22) 

where the inequality in ( 1221) comes from the maximum entropy theorem [TT] . We now bound 
h(Y 2 \U) in (!22l) . We first get an upper bound as 

h(Y 2 \U) < h(Y 2 ) < i log(2^e) n |S + E 2 | (23) 
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where the first inequality comes from the fact that conditioning cannot increase entropy, 
and the second inequality is due to the maximum entropy theorem [11]. Furthermore, using 
Lemma El we can get the following lower bound for h(Y 2 \U) 

h(Y 2 \U) > ilog(2 7 re)"|J- 1 (X + N 2 |t/)| (24) 

We next define the following function 

r(t) = ^log(27re) n |A(t) + £ 2 |, < t < 1 (25) 

where A(t) is given as 

A(t) = (1 - t) [J- X (X + N 2 |C7) - S 2 ] + tS (26) 



We first note that 

J-\X + N 2 \U) - £ 2 ^ Cov(X + N 2 \U) - S 2 (27) 

= Cov(X|[/) (28) 

r< Cov(X) (29) 

=< S (30) 

where (|2?]) is a consequence of Lemma (U and (129]) comes from the fact that the conditional 
covariance matrix is smaller than the unconditional one in the positive semi-definite ordering 
sense. This implies that for any < t < 1, A(i) satisfies 

J- 1 (X + N 2 |C/)-S 2 ^ A(t) (31) 

Using r(t), bounds in ( 1231) and ( 124T) can be rewritten as 

r(0) < h{Y 2 \U) <r(l) (32) 

Since r(i) is continuous in t [8], due to the intermediate value theorem, there exists a t* such 
that 

r(t*) = h(Y 2 \U) = l - log(27re) n |A(f ) + S 2 | (33) 
where A(f) satisfies (Ell). Plugging ([331) into ([221) yields 

R2 < - log , J S | S2 ' , (34) 

2 6 |A(t*) + S 2 | v ; 

which is the desired bound on R 2 given in (TT9j) . 
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We now obtain the desired bound on R\. To this end, using (|3"T|) and Lemma [2j we get 



A(0 h J- X (X + N 2 |?7) -S 2 (35) 

b J _1 (X + N|£/) -Sjv (36) 

for any Gaussian random vector N with covariance matrix where Sat ^ S 2 . The order 
in (136]) is equivalent to 

(A(r) + S^)- X ^ J(X + N|C7) (37) 
Next, starting from (}T6]) . we get 



^^/(XjYxItT) (38) 

= / i (Y 1 |[/)-ilog(27re) n |S 1 | (39) 

= /i(Y!|i7) - /i(Y 2 |E7) + h(Y 2 \U) - 1 log(2 7 re) n |S 1 | (40) 

= hiY^U) - fc(Y 2 |*7) + ~ log |A(0 + S 2 | - ~ log |Ei| (41) 

= -\ J2 J(X + N|l7)dE* + 1 log |A( ^+ S2 ' (42) 

<-i / s S2 (A(r) + v N rdn N + i tog |A(t ^+ Sal (43) 
i |A(fO + Si| i |A(r) + s 2 | 

= - log — ; — ; r + - log 1 — —r- — i 1 44) 

1, \A(t* ) + Ei| ,, e , 

= ^ log 1 (45) 

where (J4T| is due to f )33|) . f|42|) is obtained by using Lemma HI and (j4"3"]) is due to (j3"7|) and 

Lemma EJ Since ( Ho]) is the desired bound on Ri given in ( fT8"j) . this completes the proof. 



4 Extension to the i^-user Case 

We now extend our alternative proof presented in the previous section to the case K > 2. 
For that purpose, we need the following lemma due to [8] in addition to the tools introduced 
in Section 13.11 

Lemma 6 (|8J, Lemma 17) Let (V,U,X) be length-n random vectors with well-defined 
densities. Moreover, assume that the partial derivatives o//(u|x, v) with respect to Xi, i = 
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1, . . . , n exist and satisfy 



max 

Ki<n 



d/(u|v,x) 



dxi 



<g{u) 



(46) 



for some integrable function g(u) . Then, if (V, U, X) satisfy the Markov chain V — > U — > X, 
we have 



j(x|u) y J(x|v) 



(47) 



First, following the proof in Section [3721 we can show the existence of a covariance matrix 
A K such that 



+ N K \U K ) - =< a* r< s 

h(Y K \U K ) = X - log(27re) n |A K + Ejf | 
/i(Yjr_i|Efc) < ^^^riAx + S^I 



(48) 
(49) 

(50) 



Since we have 



h(Y K )< -log(27re) n |S + S^| 



(51) 



from the maximum entropy theorem [11] , we can get the desired bound on Rx as follows 

(52) 



Rk<I(U k ;Y k )< hog ' S+ ^'| 



When K = 2 as in Section I3.2[ showing the existence of an A^ having the properties 
listed in (|48|) - (|50|) is sufficient to conclude the proof. However, when K > 2, we need an 
additional tool, which is Lemma El and using this tool we need to repeat this step until we 
are left with showing the desired bound on the first user's rate R\. We now present the basic 
step that needs to be repeated. In particular, we now show the existence of a covariance 
matrix Ak-i such that 



J- X (X + T$k-x\U K -\) ~ Ejc-i =< ^k-i ■< A 



K 



h(Y K ^\U K ^) = - log(27re) n |A X -i + E*--i| 
h{Y K _ 2 \U K „ x ) < Uog{2ne) n \A K ^ + E x _ 2 | 



(53) 
(54) 

(55) 
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We first note the following order 



A K h J- l (X + -N K \U K )-X K (56) 

h J-^X + Njf-ilt/jf) - Sjr-i (57) 

b J-^X + Njf-ilC^-i) - Sx-i (58) 

where (156]) is due to ( 1481) , ( |57|) comes from Lemma [2J and ( l58l) follows from Lemma [6] as we 
can get 

J(X + Njsr-ilC^r-i) h J(X + N*_i|tfr) (59) 

by noting the Markov chain Uk —> Uk-i — > X + Nr-_i. 

Next, we consider the following lower bound on h{Y k-i\Uk-i) which is due to Lemma [5] 

^log(2^e)"|J- 1 (X + N i ,_ 1 |^_ 1 )| < }i(Yk-i\Uk-i) (60) 

Moreover, we can get the following upper bound 

h{Y K - X \U K - X ) = h(Y K ^\U K ^, U K ) (61) 

<h(Y K ^\U K ) (62) 

< ilog(27re) n |A K + S Ji :_ 1 | (63) 

where (16"TT) is due to the Markov chain Uk — > Uk-i — > Yk-i, ( |62l) comes from the fact that 
conditioning cannot increase entropy, and (163]) is due to fl50l) . 
We now define the following function 

r K -x(t) = ~ log(2 7 re) n |A K _ 1 (t) + S K -i|, < t < 1 (64) 
where A#_i(t) is given by 

Ajf_i(t) = (1 - t) [J- X (X + N^xIC/k-i) - Ejr-i] + tA K (65) 
Using r K -i(t), we can recast bounds on h(Y k-i\Uk-i) m (JHDD and (16"3"|) as 

rjc-i(O) < ^(Y^il^i) < rjf_i(l) (66) 

Since r^_i(t) is continuous in t [8], due to the intermediate value theorem, there exists a t* 
such that ^(Yx-iI^a'-i) = Tjc-\{t*), i-e., 

^(Y^xl^O = l - log(27re) ft |A Jf _ 1 + £ X -i| (67) 
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where we define Ak-x = A^_i(f ). Thus, we established Furthermore, it is clear that 
we also have 053J because of ([58]), (J65J) and < t* < 1. 

We now show (1551) . To this end, we note the following order 

A K -i h J -1 (X + Nk-i|Z7k-i) - S/r-i (68) 
b J-^X + NI^) -Sjv (69) 

for any Gaussian N with covariance matrix E^r d due to Lemma |2j The order in fl69|) 

is equivalent to 

J(X + N|C/^_ 1 )b(A^_ 1 + S JV )- 1 , S^S^ (70) 

We now consider h(Y k-2\Uk-x) as follows 

/j(Yjf_ 2 |L^_a) = h{Y K ^\U K ^) - fc(Y*_i|Lfc_i) + ^(Y^lC/x-i) (71) 
= ~ I K 1 J(X + N|C/)dS JV + ilog(27re) ri |AK_ 1 + S K _ 1 | (72) 

<-\j K \a K -x + ^N^d^N + i log(2 7 re) n |A iC _ 1 + S K _ X | (73) 

= llog(27re)"|A / ,_ 1 + S^ 2 | (74) 

where f [72"]) comes from Lemma S] and f[M[) . and ([73]) is due to ([70]) and Lemma [31 Thus, we 
showed ( 155]) as well. Also, we can establish the desired bound Rr-x as follows 

Rk-x<I{Uk-x^k-i\Uk) (75) 
= h{Y K . x \U K ) - h(Y K ^\U K ^) (76) 

< \]Dg(2<Ke) n \A K + V K -. x \-h{Y K -x\UK-x) (77) 



1 |A^ + S A -_x| 

n lo s n — r ( 75 



2 
1 

2 |Ax-i + 

where ([76]) comes from the Markov chain Uk — > Uk-x — > Yk-i, ( 177]) comes from ([50]) . and 
(JZSJ is due to (El). 

As of now, we outlined the basic step that needs to be repeated until we are left with 
getting the desired bound on the first user's rate R\. Following the analysis from ([53]) to 
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( I74j) . we can show the existence of covariance matrices A k , for k = 2, . . . , K, such that 



J- X (X + N k \U k ) - ± A k ± A k+l (79) 

KY k \U k ) = ilog(27re)"|A fc + S fc ] (80) 

^(Y^lC/fc) < ~ log(27re) n |A fc + S^l (81) 

where we set A^+i = S. Using these relations, we can get the bound on R k for any k = 
2, . . . , K, as 

R k <I{U k -Y k \U k+l ) (82) 

= h(Y k \U k+1 )-h(Y k \U k ) (83) 

< llog(27re) n \A k+1 + X k \-h(Y k \U k ) (84) 



2 
1 

2 ° |Afe + Sfe 



1 , |Afc + i + , . 

l0 § I A I ^ i ( 85 ) 



where we set Uk+i = 4>- The equality in f )83|) comes from the Markov chain £4 + i — )■ U k — > Y k , 
f!81|) is obtained by using (I8"T1) . and fl8o^) is due to flHOl . For = 1, we can get the bound on 
Ri as 

fl x < /(X^l^) (86) 
= / i (Yi|f/ 2 )-^log(2^e) ri |S 1 | (87) 

< - lor 



2 IS 



where (|88|) comes from f|8T|) . Finally, we define = A^ +1 — A& for k = 1, . . . , K where 
Ai = <fi, Ak+i = S, and plug these into (185]) - (IBB!) which yields the expressions in Theorem [TJ 



5 Conclusions 

We provide an alternative proof for the capacity region of the degraded Gaussian MIMO 
broadcast channel. As opposed to the existing proofs in [TJ|2], our proof does not use the 
channel enhancement technique [TJ. Our proof starts with the single-letter description of 
the degraded broadcast channel's capacity region and directly evaluates it for the degraded 
Gaussian MIMO case. This evaluation is carried out by means of two main technical tools. 
The first one is the generalized de Bruijn identity that gives a connection between the 
differential entropy and the Fisher information [5]. The second one is an inequality due 
to [Hl[7] that lower bounds the differential entropy in terms of the Fisher information matrix. 
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A Proof of Lemma 
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We define the function /(e) as follows 

/(e) = /i(X + v ^N|f/)-ilog|(2^e)(J- 1 (X|^) + eS)|, e>0 (89) 

We need to prove that /(0) > 0. We will show that /(e) is monotonically decreasing in e, 
and that lim^oo /(e) = 0. This will prove /(0) > 0. To this end, we introduce the following 
lemma which will be used subsequently. 

Lemma 7 (|8J, Corollary 4) Let X, Y, U be length-n random vectors and let the density 
for any combination of these random vectors exist. Moreover, let X and Y be conditionally 
independent given U. Then, we have 

J(X + Y|U) H [J(XIU)- 1 + J(YIU)- 1 ] 1 (90) 

Fix ei, €2 such that < e\ < e 2 . Using Lemma HI we have 

h(X + v ^N\U)-h(X + ^T 1 N\U) = l [ 2 J(X + T|f/)d£ T (91) 

where T is a Gaussian random vector with covariance matrix St such that eiS ^ St ^ ^S, 
and independent of (U, X). Using Lemma [7] in conjunction with Lemma HJ we get 

J(X + T\U)± [J-\X\U) + S T ] _1 (92) 

Plugging ( 192]) into (f9T|) and invoking Lemma [3l we get 

MX + - h(X + -Je^N\U) < i log 0»> 

Rearranging ( 1931) yields 

/(e 2 )</(ei), ei<e 2 (94) 
which proves that /(e) is monotonically decreasing in e. 
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We now consider upper and lower bounds on /(e). We have the following upper bound 
on /(e) 



/(e) = h(X + y/eN\U) --log | (2/re) (J-\X\U) + eS) | (95) 

1, iK + eSl 

-2 lQg |J-W) + ^| (96) 

1 | S -l/2 KS -l/2 + 6l | 

~ 2 ° g |S-V2j-i(x|C/)S-V2 + e i| (97 ^ 



i=l 



where fl96l) comes from the maximum entropy theorem [TT] and K denotes the covariance 
matrix of X. In f )98|) . we denote the eigenvalues of S _1/,2 KS -1 / 2 with {Aj}™ =1 , and of 
S _1 / 2 J _1 (X|t/)E _1 / 2 with {Aj}" =1 . Furthermore, we have the following lower bound on /(e) 

/(e) = h(X + sTeN\U) - i log |(2ire) (J-'(X|£/) + eS) | (99) 

1 e™ 
= 2 bg |S-V2j-i(x|C/)S-V2 + e i| (101) 

=^«n^ (io2) 



where fllOOp comes from the fact that conditioning cannot increase entropy, and in (|102p . we 
denote the eigenvalues of S- 1 / 2 J- 1 (X|[/)S- 1 / 2 with {Xi}f =v Comparison of (EH]) and ( 1T021) 
yields 

1 n 1 n \ 4- 

i=l i=l 

Taking the limit as e — > oo yields lim e _ s>00 /(e) = 0. Combining this with the fact that /(e) 
decreases monotonically in e yields /(0) > 0, and consequently, 

h(K\U) > ilog(27re)"|J- 1 (X|[/)| (104) 

completing the proof. 
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